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Abstract 

Let K„ denote the number of types of a sample of size n taken from 
an exchangeable coalescent process (S-coalescent) with mutation. A dis- 
tributional recursion for the sequence (-Kn)nsN is derived. If the coales- 
cent does not have proper frequencies, i.e., if the characterizing measure 
5 on the infinite simplex A does not have mass at zero and satisfies 
J A \x\E.(dx)/(x,x) < oo, where \x\ := Y^Li x i an d i x i x ) := x 1 f°r 

x = (sci, X2, . • .) £ A, then K n /n converges weakly as n — > oo to a lim- 
iting variable K which is characterized by an exponential integral of the 
subordinator associated with the coalescent process. For so-called simple 
measures H satisfying f A 3(dx)/(x,x) < oo we characterize the distribu- 
tion of K via a fixed-point equation. 
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1 Introduction and main results 

Exchangeable coalescents are Markovian processes with state space £, the set of 
equivalence relations (partitions) on N := {1, 2, . . .} with a block merging mech- 
anism. The class of exchangeable coalescents with multiple collisions has been 
independently introduced by Pitman [53] and Sagitov [53] . These processes can 
be characterized by a finite measure A on the unit interval [0, 1] and are hence 
also called A-coalescents. The best known example is the Kingman coalescent 
where A = So is the Dirac measure in 0. This coalescent allows only for binary 
mergers of ancestral lineages. Another well studied coalescent is the Bolthausen- 
Sznitman coalescent [5], where A is uniformly distributed on [0, 1]. The full class 
of exchangeable coalescents allowing for simultaneous multiple collisions of an- 
cestral lineages was discovered by Mdhle and Sagitov [55] and Schweinsberg 
[2"rj] , Schweinsberg [5B] characterizes exchangeable coalescents via a finite mea- 
sure S on the infinite simplex A := {x — (xi,X2,...) : x\ > X2 > ••• > 
0j x i ^ !}■ F° r the following it is convenient to decompose 5 = aSo + So 
with a := S({0}) S [0, oo) and So having no atom at zero. Suppose that the coa- 
lescent is in a state with n blocks. Then each (fci, . . . , kj ;)-collision (k±, . . . , kj G N 
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with ki + ■ ■ ■ + kj = n, ki > ■ ■ ■ > kj and k\ > 2) is occurring at the rate (see 
\M Eq- (11)]) 

4>j(k\, . . . ,kj) = fll{ r 

where s := \{1 < i < j : ki = 1}|, r := j — s, \x\ :— J2i=i x i an d ( x i x ) '■— 
I2t=i x i for x = (xi,x 2 , ■ •■) € A. Note that ^i(2) = 3(A). 
For n £ N let g n : £ — ► £ n denote the natural restriction to the set £„ of all 
equivalence relations on {1, ... , n}. Let R — (Rt)t>o be a coalescent process with 
simultaneous multiple collisions. The restricted coalescent process (g n Rt)t>o is 
usually interpreted as a genealogical tree of a sample of n individuals. 
In the biological context it is natural to introduce mutations into this model 
as follows. Assume that each individual has a certain type. Independently of 
the genealogical tree mutations occur along each branch of the tree according 
to a homogeneous Poisson process with rate r > 0. The infinitely many alleles 
model is assumed, i.e., each mutation leads to a new type never seen before in 
the sample. 

Recently there is much interest in the study of functionals of restricted coalescent 
processes (g n Rt)t>o, f° r example the number of collisions [TOj, HU [15l [16] , the 
time back to the most recent common ancestor and the lengths of external 
branches [6l [H [12] , the total branch length [9] or the number of segregating sites 

mi- 

Further typical quantities of interest are JQ(n), the number of types which 
appear exactly i times in a sample of size n, and the summary statistics 
K n := Kj(n), the total number of types in the sample. The most cele- 

brated result in this context is the Ewens sampling formula [11] for the distribu- 
tion of the allele frequency spectrum {K\{n), . . . , K n (n)) under the Kingman co- 
alescent. Recently asymptotic results for the allele frequency spectrum have been 
obtained by Berestycki, Berestycki and Schweinsberg [21 [3] for beta(2 — a, a)- 
coalescents with parameter 1 < a < 2 and by Basdevant and Goldschmidt [1] 
for the Bolthausen-Sznitman coalescent [5]. Here we are interested in the total 
number K n of types of a sample of size n € N taken from a S-coalescent with 
mutation rate r > 0. The motivation for our interest in K n is manifold. It is an 
observable quantity and hence important for biological and statistical applica- 
tions. In combination with the results of [20] on the allele frequency spectrum 
and of [H] on the number of segregating sites, our study of K n gives additional 
insight in the structure of exchangeable coalescent trees. Our first result (The- 
orem 11.11 below) provides a distributional recursion for the sequence (K n ) n ^. 
In order to state the result we need to introduce the rates 

g nk := lim gflgnjM^jO , n, fc e N, fc < n, (2) 



= l,fci=2} 

- \x\y~ l 



E 
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and the total rates 

r P (\SnRt\ < n) ^ 
g n := lim = 2_^9nk, n EN. (3) 

k— 1 

The total rates g n , n € N, can be expressed in terms of the measure S = a<$o+So 
as (see Schweinsberg p. 36, Eq. (70)]) 

a ? n \ i C \ 1 n uw v^AA _ _ \ s (dx) 



all distinct 

A similar argument shows that the rates @ are given as 
with 

il ij-EN ni njEN V ' J 

alldistinct \-nj=n-k+j 

for n, fc € N with k < n and j € {1, . . . , /c}. The A-coalescent occurs, if the 
measure S is concentrated on the points x = (u, 0, 0, . . .) € A with u 6 [0, 1] 
and can be hence considered as a measure A on the unit interval [0, 1]. In this 
case only the index j = 1 contributes to the sum below the integral in ([5]) and 
from f nk i(u, 0, 0, . . .) = ( fc *J (l-u) k - 1 u n - k+1 it follows that © takes the form 



n 

9nk 



vP-'-^l-u)*- 1 A(du), n,fceN,fc<n. (6) 

[0,1] 



Jt - 1 

Similarly, for the A-coalescent the total rates (d]) are given as 

f 1 - (1 - u) n - null - uY 1 - 1 

9n = / j— i 1 A(du), neN. (7) 

J[o,i] u 

Our first main result is the following distributional recursion for the number of 
types K n . 

Theorem 1.1 The sequence (K n ) ne iq satisfies the distributional recursion 
K x = 1 and K n = B n (K n ^ + 1) + (1 - B n )K In , n e {2, 3, . . .}, (8) 

where B n is a Bernoulli variable independent of (K2, ■ ■ ■ , K n —\, I n ) with distri- 
bution 

n r 

P{B n = \) = l-P{B n = 0) = ■ , neN, 

g n + nr 

and I n is a random variable independent of (K2, ■ ■ ■ , K n —\) with distribution 
rnk := P(I n = k) = 9 —, n,k£N,k<n. (9) 

9n 
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Note that is the number of equivalence classes (blocks) of the restricted 
coalescent process {g n Rt)t>o after its first jump. 

The proof of Theorem 11.11 given in Section [2] involves a combination of what 
Kingman [18] calls natural coupling and temporal coupling. The main argument 
of the proof is the same as that used in [2U] and [3T] for deriving similar re- 
cursions for the allele frequency spectrum and the number of segregating sites. 
The recursion for the summary statistics K n is simpler than that for the allele 
frequency spectrum presented in [2D] . It is therefore more useful to compute the 
distribution and other related functionals of the distribution of K n for moderate 
values of n in reasonable time. Moreover, Theorem 1 1.1 1 is valid for any arbitrary 
S-coalescent. 

Our second result (Theorem 11.21 below) concerns measures S satisfying 



Recall that |a;| := x i an d that (x, x) :— YliLi x i f° r x ~ ( Xl > x ^i • ••)*= ^. 

Note that (fT0|) prevents S from having too much mass near zero. Schweinsberg 
[26l Prop. 30] showed that the S-coalescent does not have proper frequencies if 
and only if (|10[) holds. Not having proper frequencies is equivalent to having a 
positive fraction of singleton blocks with positive probability, which is actually 
most important for our convergence result presented in Theorem 1 1 . 21 below. For 
the special class of coalescent processes with multiple collisions (A-coalescents), 
Eq. (fTOf takes the form 



Pitman [23, Theorem 8] already showed that the A-coalescent does not have 
proper frequencies if and only if (fTT|) holds. Condition (fTTj) excludes impor- 
tant examples such as the Kingman coalescent and the Bolthausen-Sznitman 
coalescent [5]. However, it includes for example all beta(a, 6)-coalescents with 
parameters a > 1 and b > 0, which are studied in more detail in Section [5] Note 
that Theorem 11.21 covers a substantial class of S-coalescents. 

Theorem 1.2 Suppose that the characterizing measure S of the exchangeable 
coalescent (Rt)t>o satisfies (10\) . Then K n /n converges weakly as n — > oo to 
K := r Jq 00 e~ rt e~ Xt dt, where X — (X t ) t > is a subordinator with Laplace ex- 




(10) 



A({0}) = and 




(11) 



ponent 




The limiting variable K has moments 



(r + *(l))(2r + $(2)) • • • (jr + $(.?)) ' 



j'eN. 
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We will see that the subordinator X appearing in Theorem 11.21 is related to 
the frequency St of singletons of R t via X t — — log S t , t > 0. Our proof of 
Theorem 1 1.2 1 is not based on the recursion presented in Theorem ll.il It is rather 
a consequence of the chain of inequalities 



where M n denotes the number of mutated external branches and N n denotes 
the total number of mutated branches of the restricted coalescent tree (g n Rt)t>o 
respectively. Here we call a branch mutated, if it is affected by at least one 
mutation. In a first step it is shown in Section [3] that Theorem 11.21 is valid with 
K n replaced by the lower bound M n . Afterwards in Section Q] it is verified that 
(N n — M n )/n — > in probability (even in L 1 ), which completes the proof of 
Theorem 11.21 and in addition shows that Theorem 11.21 remains valid with K n 
replaced by N n . Note that, if K n ^\ denotes the number of types which appear 
exactly once in the sample of size n, then M n < K nt % < K n , and, consequently, 
Theorem 11.21 remains also valid with K n replaced by K n ,i- 
Theorem 11.21 leaves open the question about the asymptotical behavior of K n 
for the important class of S-coalescents which do not satisfy condition IT0|) . As 
mentioned before, some results for particular A-coalescents are known ([T], [2], 
[2] j [llj - [20] ). however, the problem concerning the asymptotical behavior of 
K n for the full class of S-coalescents remains open. 

2 A recursion for the number of types 

The proof of Theorem ll.il is based on two fundamental properties of coalescent 
processes which Kingman [18. calls natural coupling and temporal coupling. 
Natural coupling states the following. Suppose a genealogy of a sample of size n € 
N governed by a S-coalescent is given. If a sub-sample of size m G {1, . . . , n — 1} 
of this sample is taken, i.e., if n — m individuals are removed from the sample, 
then the genealogical tree of the remaining sample of size m is governed by 
the same S-coalescent. This consistency relation between different sample sizes 
is one of the fundamental properties of exchangeable coalescents. It is in fact 
needed in order to prove the existence of exchangeable coalescent processes with 
state space £ via Kolmogoroff 's extension theorem. 

The second property, called temporal coupling states the following. Consider a 
restricted coalescent process (i^ n) ) t > := (g n Rt)t>o and let T„ := inf{t > : 
r[ 71 ^ R^} denote the time of its first jump. If you identify individuals which 
belong after that first jump to the same equivalence class, then the process 
started at time T n is distributed as a coalescent with sample size l-R^I. Math- 
ematically this property essentially boils down to the strong Markov property. 
We will now verify Theorem ll.il 

Proof of Theorem 11.11 The recursion ([8]) is equivalent to P(K\ = 1) = 1 and 



M n < K n < N n + 1, 



(13) 



n-l 



P(K n - k) 



nr 



k-l) + 



fjn 



r ni P{Ki = k) (14) 



+ nr 



+ nr 



i—k 
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for n £ {2,3,.. .} and k £ {1, . . . , n}. We verify (fH)) in analogy to the proofs 
presented in [20] by looking at the first event (either a coalescence or a mutation) 
which happens backwards in time. 

The time W n back to the first mutation is exponentially distributed with pa- 
rameter nr. The time T n back to the first coalescence is independent of W n and 
exponentially distributed with parameter g n . Thus, the first event backwards 
in time is a mutation with probability P(W n < T n ) = nr/(g n + nr), and a 
coalescence with the complementary probability P(T n < W n ) — g n /(9n + nr). 
Note that these two probabilities appear on the right hand side in fl!4[) . 
Assume that the first event backwards in time is a mutation. If we disregard the 
individual which is affected by this mutation, the number of types decreases by 
one. Moreover, from the natural coupling property it follows that the remaining 
tree is distributed as a coalescent restricted to the set {l,...,n — 1}. This 
argument explains the appearance of the probability P(K n -\ = k — 1) on the 
right hand side in (fT4|) . 

If the first event backwards in time is a coalescence, then at the time of that 
coalescence event, the coalescent process jumps to a partition with i blocks, 
i € {l,...,n — 1}, with probability r n i = g n i/g n . By the temporal coupling 
property, the coalescent process stopped at that time is distributed as a coa- 
lescent restricted to the set {1, . . . , i}. As the number of types is not affected 
by a coalescence, the appearance of the sum on the right hand side in (TTJ| is 
explained. Note that it suffices to run the sum from k to n — 1 as P(Ki = k) = 
for i < k. □ 

Remarks. 1. In terms of the generating function f n (s) := E(s Kn ), n € N, s € C, 
the recursion ([5]) (or (TH)) ) is equivalent to /i(s) = s and 

n-l 

(g n + nr)f n (s) = nrsf n -i(s) +/ j g n kfk(s), n e {2, 3, . . .}, s S C, (15) 

k=l 

a formula which follows (at least for coalescent processes with multiple collisions) 
also by taking si = • •• = s n =: s in Eq. (4) of [19]. 

2. The recursion (Ti~4"l) for the distribution of K n is useful to compute the prob- 
abilities P(K n = k) successively for k = n, n — 1, . . . , 1. For example, for k = n 
it follows that (g n + nr)P(K n = n) = nrP{K n -\ = n — 1) and, therefore, 

71 ■ n~l I 

t — r IT T TL 

Note that P(K n = n) is the probability to have only singletons in the sample 
of size n. 

Example. (Kingman coalescent) For the Kingman coalescent (A = So) we have 
I n = n — 1, g n = 9n,n-i = n(n - l)/2 and g ni = for i € {1, . . . , n - 2}. 
The recursion ([8]) reduces to K n = B n + K n -\. Therefore, K n = J2i=iBi, 
n E N, where Bi,B2,.-. are independent Bernoulli variables with P(B n — 



G 



1) = nr/{g n + nr) = 9/(9 + n — 1), n £ N, with 9 := 2r. It follows easily 
that P(K n = k) = 9 k s(n, k)/[9] n , where [9} n := 9(9 + 1) • • • (9 + n - 1) and the 
s(n, k) denote the absolute Stirling numbers of the first kind. Moreover, ¥i(K n ) = 

OYSZo + i) ~ O logn and Vav(K n ) = 0Yh=i i/(0 + if_~ 9logn. By the 
Lindeberg- Feller central limit theorem, (K n — 9 log n)/y / 9 logn is asymptotically 
standard normal distributed. All these results are of course well known and go 
at least back to the seminal work of Ewens [TT] . 

Example. (Star-shaped coalescent) For the star-shaped coalescent (A = Si) 
we have /„ = 1, g n \ — g n = 1 and g ni — for i £ {2, ...,n— 1}. Therefore, 
(fT3|) reduces to (1 + nr)f n (s) — nrsf n -i(s) + s, n £ {2, 3, . . .}, s £ C. We refer 
to [201 Section 4] for more details. In particular, in [20] it is shown that K n /n 
converges almost surely to a limiting random variable K, beta distributed with 
parameter 1 and 1/r, that is P(K > x) = (1 — x) 1 ^, < x < 1. 

Remark. (Recursion for the factorial moments of K n ) Taking the jth derivative 
with respect to s in (fT5|) and applying the Leibniz rule yields 

n-1 

(0» + nr)/W(s) = nr( S /« 1 ( S )+ J 7„r i 1) (.s))+^ 5 „ fc /^ ) ( S ) 

fc=i 

for n € {2, 3, . . .}, j £ N and s £ C. For n G N and j G N let fiiP := E((i^ n ) i ) = 
E(A'„(AT rl — 1) • ■ ■ (AT„ — j + 1)) denote the jth descending factorial moment of 
K n . Taking the limit s — > 1 it follows that 

n-1 

(g n + nr)M« = + j^Ip) + ]T ^fcM^ , n e {2, 3, . . .}, j £ N. 

fc=i 

This recursion with initial condition ji^' = 8j\ (Kronecker symbol) is useful 
to compute the factorial moments of K n . For example, for j = n we have 
(g n + nr)fin — 'n 2 T^n~\ > an d, therefore, 

a result which also follows from — n\P(K n = n). In particular, the first 
moment /Lt„ := fj,n^ — E(A"„) follows the recursion fii = 1 and 

n-1 

(g n +nr)fj, n = nr(/i„_i + 1) + ^ .g nfc ^ fe , ti G {2,3,...}. (16) 

fc=i 

It seems to be non-trivial to solve any of these recursions except for the Kingman 
coalescent (A = 5q) and the star-shaped coalescent (A = Si). We therefore focus 
on asymptotic results for K n as the sample size n tends to infinity. 
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n 



-!(n!) 2 



U7=2(9i+ ir V 



n £ N, 



3 The number of mutated external branches 

We say that a branch of the restricted coalescent tree (g n Rt)t>o is mutated, if it 
is affected by at least one mutation. In this section we study the asymptotics of 
the number M n of mutated external branches of (g n Rt)t>o under the assumption 
that the measure S satisfies the condition p0[) . 

Lemma 3.1 Suppose that the characterizing measure S of the exchangeable co- 
alescent process R = (Rt)t>o satisfies U0\) . Then, M n /n — > M as n — > oo, 
where M is a random variable uniquely determined by its moments 

k 

E(M fe ) = E(T|(l-e- rL *)), fceN, 
i=i 

with Li := sup{t > : {i} is a block of Rt\, i EN. 

Proof. For neN and i E {1, . . . , n} let 

L ni i := sup{t > : {i} is a block of g n Rt} 

denote the length of the ith external branch of the restricted coalescent tree 
{gnRt)t>o- Fix k E N and t\,...,tk S [0, oo). For n > k we have 

P(L nA >h,..., > tk) 

= P({1} is a block of gnRt!, ■ ■ ■ , {k} is a block of g n Rt k ) 

— » P( P| {{1} is a block of g n Rt x , ■ ■ ■ , {k} is a block of g n R tk }) 

nGN 

= P({1} is a block of i? tl , . . . , {k} is a block of R tk ) 
= P{L 1 >t 1 ,...,L k >t k ). 

Thus, for all k G N, (£„,i, . . . , £ n ,fe) — * • ■ ■ > -^fc) asn^oo. 
For n E N and i € {1, .. . ,n} let E n ,i denote the event that the ith external 
branch of the restricted tree (g n Rt)t>o is affected by at least one mutation. 
Conditional on the lengths L nt \, . . . , L n ^ n of the external branches, the mutation 
Poisson process with parameter r > acts independently on all these branches. 
Thus, for fixed j E N we have 

P(£W,i n • • • n Enj) = e(p(e„ i n • • • n E 7hJ \L 7hl , . . . , L nJ )) 

= E(P(E nA \L nA ) ■ ■ ■ P(E ntj \L n j)) 
= E((l - e~ rL ^) ■ ■ ■ (1 - e~ rL ^)) 
-» E({l-e- rLl )---{l-e- rL >)). 

From M n = ls„ < it follows that 

n k n 

E(Af„ fe ) = E((^l^) ) = £ E(l BrMi ...l En)< J 

i=l ii,...,tfc=l 

E eci^.-.i^j. 

ii,. . .,•*»£{!, ...,n} 
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For each fixed n, the events E n ^, i € {1, . . . , n}, are exchangeable. Therefore, 

k 

E(M*) = ^s^jXnW^in-n^), 

i=i 

where S(k,j) denotes the Stirling number of the second kind, i.e. the number 
of ways to partition a set with k elements in j non-empty subsets. Division by 
n k and taking the limit n —> oo yields for all k G No 

k 

limEfffi") = f>(fe,i) lim ifp^n-n^) 
= lim P(£„,i n • • ■ n E, hk ) 

n— »oc 

= S((l-e-'' Ll )---(l-e- rZ "=)) =: 

For all m, k G N , 




Thus (Hausdorff moment problem), the sequence (fj, k )keN is a moment sequence 
of some random variable M taking values in the unit interval [0, 1]. The conver- 
gence of moments implies the convergence M n /n A M. □ 

Remark. There is the following interpretation of the distribution of the limiting 
external branch lengths Li, i G N, in terms of the frequency spectrum of the 
coalescent. Let St denote the frequency of singletons of R t . Conditional on 
St-t, • • • , St k , the probability that i is still a singleton at time ti, i G {1, . . . , fc}, 
is S tl ■ ■ ■ S tk ■ Therefore, for ti, . . . , tf. G [0, oo), 

P(L 1 >t 1 ,...,L k >t k ) = E(S tl ---S tk ), 

or, equivalently (in agreement with the principle of inclusion and exclusion), 

P(L 1 <t 1 ,...,L k <t k ) = E((l-S tl )-.-(l-S t J). 

Thus, the distribution function of (L\, . . . , L k ) can be expressed in terms of the 
process S = (S t )t>o- 

The following Corollary 13.21 expresses the distribution of the limiting random 
variable M appearing in Lemma 13.11 in terms of the process (St)t>o- There 
is the following rough intuition for the form of the integral in Corollary 13.21 
A contribution to M n occurs every time a lineage that has not yet coalesced 
experiences its first mutation. The time of a first mutation is exponentially 
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distributed with parameter r, so at each time t the infinitesimal growth of M n 
due to a not yet coalesced lineage is re~ rt . Since S t is the fraction of singletons 
at time t, the infinitesimal growth of M n at time t is approximately re~ rt nSt- 
In 21j, when the number of segregating sites is the quantity of interest, any 
mutation contributes to the count rather than just the first one, so we get r in 
Proposition 5.1 of [3T] in place of the re~ rt in Corollary[ 



Corollary 3.2 The limiting variable M appearing in Lemma \3.1\ satisfies 



oo 



M = r e- rt S t dt. 
Jq 

Proof. Fix k G N and define g : R k -> JR. via g(t) := (-l) fe e - r (*i+---+**) for 
t = (ii,..., i fe ) G R fc . Note that := ^ f * gt g(t) = r fc e - r (*i+- +t *). For 
.t = (xi, . . . ,Xk),y = (j/i j . . . ,yk) € K fc with a;, < j/j for all 1 < i < fc define 

:= 51 (-l) £l+ - +£fc 5(eia;i + (l- £i)2/i, + 

El, £{0,1} 

With the notation i := (£1, . . . , L^) we have 

E(Af fc ) - E((l- e -^)...(l- e -^)) = E(A^g) 

AlgP L (dy) = f f l [0 . y) (t)h(t)X k (dt)P L (dy). 

R* JR^ «/jR* 

An application of Fubini's theorem yields 
E(M k ) = f h(t) f l {tt0o) (y)P L (dy)X k (dt) = f h(t)P(L > t) \ k (dt) 

r{ti+-+t*) E (S tl ---S tk )\ k (dt 1 ,...,dt k ) 





Thus, the moments of the random variables M and re~ rt St dt coincide. As 
both random variables take almost surely values in the unit interval [0, 1], they 
are equal in distribution. □ 

The moments of M can be expressed in terms of the measure 3 as follows. 

Remark. Assume that the measure S of the exchangeable coalescent (Rt)t>o 
satisfies (fTD|) . From the Poisson construction of the !B-coalescent (see Schweins- 
berg [55]) it follows that the process X = (X t ) t >o, defined via X t := — log St 
for t > 0, is a drift-free subordinator with Laplace exponent 

m = / jj-M! E(dx), r?>0. 

J A \{o} [x,x) 
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Note that, for 77 E N, e _t *W = E(e-" x *) = E(S , f '') is the probability that 
{1}, . . . , {rf\ are (singleton) blocks of R t . The Levy measure g on (0, 00] of the 
subordinator X is hence the image of the measure v{dx) :— E{dx)/(x,x) via 
the transformation T(x) :— — log(l — \x\), i.e. £>(^4) = j T _ 1 ^(x,x)^ 1 'E,{dx) for 
all Borel subsets A of (0, 00]. This result is in agreement with Proposition 26 of 
Pitman [33] for the special situation when the coalescent allows only for multiple 
collisions (A-coalescent). From 



A 



\{o} (x,x) 



E(dx) 



A\{0} 



|a;|^(<ix) 



(1 



g(dy) 



(0,00] 



and (1 - e _1 )min(y, 1) < 1 - e~ v < min(y, 1), y > 0, it follows that dTUJ) is 
equivalent to 



Q({0}) 



and 



(0,00] 



nun(y, 1) g(dy) < 00. 



(17) 



Note that the finiteness of the last integral is the typical condition for a measure 
g for being a Levy measure of some subordinator. From Proposition 3.1 of [7] it 

follows that M = r J °° e~ rt ~ Xt dt has moments 



E(M fe ) = 

In particular, 

Var(M) 



- k k\ 



(r + $(l))(2r + $(2)) ■ ■ ■ (kr + $(fc)) ' 



(18) 



E(M 2 



- (E(M)) 5 
2r 2 



(r + *(l))(2r + $(2)) (r + $(l)) 2 

2r 2 (r + $(1)) - r 2 (2r + $(2)) 

(r + *(l)) 2 (2r + $(2)) 
„2 



(r + $(l)) 2 (2r + $(2))y A \ {0} (*,*) 
as 2$(1) - $(2) = / AU0} lilVCx.^di). 



(19) 



In the final remark of this section a distributional fixed-point equation for M is 
derived for S-coalescents satisfying 



S({0}) 



and 



5 (da;) 
.\{o} (x,x) 



< 00 . 



(20) 



In the spirit of Bertoin and Le Gall [4] we call measures 5 satisfying (|20| simple 
measures. Note that (|20l) implies (fTU|) . 



Remark. If ([20)1 holds, then the Levy measure g of the subordinator X = 
(X t )t>o is finite (mo := g((0, 00]) = i/(A \ {0}) < 00), which means that X 
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is a compound Poisson process X t — Yli=i Wii where N :— (N t )t>o is a homo- 
geneous Poisson process with parameter mo and )),, i £ N, are random vari- 
ables, independent of each other and of N, with common distribution function 
y i > P(rji < y) = Wq 1 g((0,y]). Let Tj < T 2 < T 3 < ■ ■ ■ denote the jump times 
of the Poisson process N. Note that Ti+i — Ti is exponentially distributed with 
parameter m . We have 

Ti 



l>oc ^ r 1 i+i 

M = re- rt S t dt = Y. re- rt S t dt 



i=0 ' 

rt ji , „— J7i / „„-rt ji | „— J7i— n 2 / 



re-^dt + e-" 1 / re"" (it + e^" 1 -" 2 / re~ rt + ■ • ■ 

= (1 - e- rTl ) + e-" 1 (e- rTl - e~ rT2 ) + e -'"i-^2( e -rT 2 _ e -rT 3 ) + . . . 
•((1 - e-^-TO) + g-^^-rCTa-TO _ g-rCT.-Tx)) + . . 

= B + A(l-B)Mt, 

with A := e-vi, B := 1 — er rTl and M : = M. Thus, M satisfies the distribu- 
tional fixed-point equation 

M = B + A(1-B)M, (21) 

where A and B are independent (and independent of M), B is beta distributed 
with parameters 1 and mo/r, i.e., P(B > x) = (1 — a;)™"/ 1 ", a; G (0,1), and 
the distribution of 1 — A is the image of the measure Vq := vjm^ under the 
transformation |.| : A \ {0} — > (0,1], a; i— > |x|. Using an argument similar to 
that of Vervaat [57] shows that the distribution of M is uniquely determined 
by the fixed-point equation (|2"Tj) . The distribution of M coincides with the sta- 
tionary distribution of the process (Y n ) n( =fi recursively defined by Yq := and 
Y n+ \ := A n (l — B n )Y n + B n , where ((A n , B n )) ne m is a sequence of independent, 

identically distributed random variables with (A n , B n ) = (A, B). Note that 

n— 1 n— 1 n— 1 i—l 

i=0 j—n—i i—0 .7=0 

and, hence, that M = X)So ^» Iito ~ 



4 The total number of mutated branches 

In order to analyze the total number N n of mutated branches we need to study 
C n , the number of collision events that take place in the restricted coalescent 
process (g n Rt)t>o until there is just a single block. Note that, in general, C n > 
X n , the number of jumps. For A-coalescents we have C n = X n . 

Lemma 4.1 Let R be a S- coalescent. If ilO\) holds, then C n /n ^ in L 1 . 
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Proof. For n 6 N define a n := E(C n ) for convenience. Note that the sequence 
(fln)neN satisfies the recursion ai = and a n = v n + J2k=i r nkdk for n <E 
{2, 3, . . .} with r n k := P(I n = k), n, k e N, k < n and v n :— E(V^), where V n 
denotes the number of internal branches starting at the time of the first jump 
of the restricted coalescent {g n Rt)t>o- 

We verify the convergence C n /n — > in L 1 by contradiction in analogy to 
Gnedin's proof of Proposition 3 in [T3] . Note that a similar argument is used on 
p. 219 of [T7]. Assume that there exists e > such that a n > ne for infinitely 
many values of n. Selecting e smaller, for any fixed c we can obtain the inequality 
a n > en + c for infinitely many values of n. Let n c be the minimum such n. 
Then n c — > oo as c — > oo. For k < n c we have ak < sk + c which implies 

n c -l 

en c + c < a nc = v ne + r nctk a k 

k=l 
n c -l 

< «n e + c + £ fcr Wc ,fc = w„ c + c + eE(7„J. 
fe=l 

The constant c cancels and it follows that eE(rt c — I Uc ) < ii„ c . For c->oowc 
obtain the promised contradiction, as "E(n—I n )/v n — > oo as n — > oo by Corollary 
16.51 given in the appendix. Thus, for all e > there exists no — no(e) G N such 
that a n /n < e for all n > hq. In other words, a n /n — * as n — > oo. □ 

We are now able to show that, if (fT0|) holds, then the total number 7V„ of mutated 
branches and the number K n of types both have the same asymptotic behavior 
as M n as n — > oo. 

Corollary 4.2 Let (Rt)t>a be a S- coalescent with mutation rate r > satisfying 

and as weH K n /n M, where M is the random 
variable defined in Corollary I ff. g| with moments H8\) . 

Proof. We have M n < K n < N n + 1. Thus, by Lemma [37X1 it suffices to verify 
that (N n + 1 — M n )/n—* in probability. We even show that (A^„ + 1 — M n )/n — *■ 
in L . We have 

< K n -M n < N„ + 1-M n 

= number of non-external mutated branches + 1 
< number of non-external branches + 1 = C' n . 

It remains to note that C n /n — > in L 1 by Lemma B~T1 □ 

Note that Corollary 14.21 in particular finishes the proof of Theorem 11.21 

5 Examples 

In this section we apply Theorem 11.21 to some concrete examples. 
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Example 1. (Dirac coalescents) Fix a point c 6 A \ {0} and suppose 
that H = 5 C is the Dirac measure in c. Then, condition (jTOj) holds, as 
f A ^ Q y(\x\/(x,x))E(dx) — \c\/(c,c) < oo. By Theorem 11.21 all three ran- 
dom variables, M n /n, K n /n and N n /n converge in distribution to M := 
r J e~ rt ~ Xt dt, where X = (X t )t>o is a subordinator with Laplace exponent 
®(v) = (!-(!- I c l) r ')/( c , c ), V > 0- The Levy measure g = (l/(c, c))<5_ i og (i-| c |) 
is hence the Dirac measure in — log(l — |c|) scaled by the factor l/(c, c). We 
have $(1) = \c\/(c,c) and $(2) = |c|(2 - \c\)/{c,c) and, therefore, by (T8J) and 

E(M) = — (22) 

(c,c) 

and 

Var W = ; 7 ^Stt Eg-BK - (23) 

( r+ ^j) ( 2r+ (c,c) ) 

Note that m := J A . , -j.(l/(a;, a;))S(dx) = l/(c, c) < oo, i.e. ([20]) holds as well. 

Thus, by (f2~T|) . M satisfies the distributional fixed-point equation M = B + 
(1 — |c|)(l — B)M, where _B is a random variable independent of M and beta- 
distributed with parameters 1 and mo/r = l/((c, c)r). Even for this quite simple 
situation of Dirac coalescents, it does not seem to be straightforward to find 
simpler characterizations for the distribution of M. 

Example 2. (beta-coalescents) Let A be beta distributed with parameters a > 
1 and b > 0, i.e., A has density u i-> (B(a, 6)) _1 M a_1 (l - w) b ~\ u S (0,1), 
with respect to the Lebesgue measure on (0, 1), where £?(., .) denotes the beta 
function. In this situation we have 

_ x B(a-l,b) a + b-1 

u A (du) = v = — < oo. 

[o,i] B(a,b) 0-1 

Thus, Theorem 11.21 is applicable and all three random variables M n /n, K n /n 
and N n /n, converge in distribution to M := r J Q e~ rt ~ Xt dt as n — > oo, where 
X = (X t )t>o is a subordinator with Laplace exponent 



1 r^i-(i- u )v 



S(a,6) Jo u 2 
The expansion 1 - (1 - u) 7 * = (?)(-l) l +V yields 

= -^TTsfl (D(-iy +1 B(a + i-2,b) 



B(a,b)^\i 

= £±^lf;f?V-i)WTT °; 2 +J . , ?/ > . 

o-l f-f VV 1 _ 1 a + 6-2 + j' 

Note that $(1) = (a + 6- l)/(a- 1) and $(2) = (a + 26- l)/(a- 1). The mean 
and the variance of M can be easily deduced from (|18|) and (fT9|) . 
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From g((0,y]) = z/((0, 1 — e y ]) = f, Q 1 _ e - y i u 2 A(du) it follows that the Levy 
measure g of the subordinator X has density y i— > (2? (a, 6)) _1 (1 — e _a ) a_3 (e _a ) f ', 
y € (0, oo), with respect to the Lebesgue measure on (0, oo). If a > 2, then 

/" 2 w j \ (o + 6-l)(o + 6-2) 

m := / u 2 A(du) = ^ '- < oo. 

J [o,i] (a-l)(a-2) 

In this case, by (j2"Tj) . M satisfies the distributional fixed-point equation M 
IS ■ A ; 1 B)M, where A and S are independent (and independent of M), 1 — A 
is beta distributed with parameters a — 2 and 6, and B is beta distributed with 
parameters 1 and mo/r. 

For special parameter values of a and b the Laplace exponent $ can be further 
simplified. For example, for the /3(2 — a, a)-coalescent with < a < 1, 

W l-a^\ij\i-lj (l-a)r(a + l)r(T7 + l)' V ~ 

Note that, if the conjecture on p. 495 of Basdevant and Goldschmidt [1] is 
correct, then we have identified (in the notation of [1]) the distribution of the 

random variable Ci, namely C\ = M. 

Example 3. Suppose that the measure S is concentrated on the subset A* of 
all points x € A satisfying |x| = 1 and that mo := J A ^; -.(l/(x,x))S(dx) < oo. 
Concrete examples are the star-shaped coalescent, where 3 is the Dirac measure 
in (1, 0, 0, . . .), or the Poisson-Dirichlet coalescent with parameter 8 > 0, where 
S is assumed to have density x i— > (x, x) with respect to the Poisson-Dirichlet 
distribution with parameter 8 > 0. Then, (fT0|) and (|20|) coincide and are both 
satisfied. Thus, Theorem ll.2l is applicable, i.e., all three random variables, M n /n, 
K n /n and N n /n, converge in distribution to a limiting variable K with moments 
(TT2"1) . As the measure S is concentrated on A*, the Laplace exponent $(77) = mo 
is constant. Therefore, K has moments E(iP) = r J j!/((r + mo) • • • (jr + mo)), 
j G N. It follows that K is beta-distributed with parameters 1 and mo/r. 



6 Appendix 

In this appendix basic results for S-coalescents R = (Rt)t>o are derived. We first 
restrict our attention to coalescents with (only) multiple collisions, as the proofs 
are in this case less technical. Afterwards we extend the results to S-coalescents. 
Our first result (Lemma 16. ip concerns the number of blocks /„ of the restricted 
coalescent process (g n Rt)t>o after its first jump. Note that 7„ has distribution 
((9]) and that we define I\ := for convenience. Lemma l6Tl is well known from the 
literature (see, for example, Schweinsberg [251 Lemma 3]), however, we provide 
a proof which can be extended to the full class of coalescents with simultaneous 
multiple collisions (see Lemma [6~3|) . 
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Lemma 6.1 Let R be a A-coalescent. Then, for all n £ N, 

g n E(n-I n ) = / ^ >— 2 A(du) 

J[0,1] M 

iirai/i continuous extension of the function below the integral for u \ 0. 
Proof. We have 

n— l n— l r / \ 

g n E(I n ) = J2 k 9nk = E fc / L -1 ) un ~*~ 1 ( 1 -«)*~ lA ( du )- 
fe=i fc=i -^I ' 1 ! ^ ' 

Substituting i = k— 1 and interchanging the summation with the integral yields 



„ n-2 

n(l - it) + 1 - n 2 u(l - u)™" 1 - (n + 1)(1 - it)" 
— — A(du) 

I.J] u z 

n + l — nu — n 2 u(l — uY 1 ^ 1 — (n + 1)(1 — u) n . , , . 
5 — A(du). 

[0,1] 

Now subtract this expression from 

f n — nil — u) n — n 2 u(l — u)" -1 , . 
ng n = / 1 - = 1 '- A(du). □ 

Corollary 6.2 If holds, then E(n — I n ) ~ n /9nJ^ ji « A(dit) — » oo as 



Proof. For n £ N define the auxiliary function i?(n) := Jr 01 i(l — (1 — 
u) n )u~ 2 A(du). Note that 1 - (1 - u) n < nu for n £ N and u £ [0,1] and 
therefore H{n) < n Jj Q ^ A(du)/u — nH(l) < oo for all n € N. By Lemma 
6.1[ <7„E(n — I n ) = nH(l) — H(n). If we can show that g n /n — > and that 
H(n)/n — > as n — > oo, then, 



E(n-7„) = ^(ff(l)-^) ~ ^ff(l) 
V n J g„ 

and we are done. Since 



f 1 - (1 - it)" - nu(l - it)"" 1 i/i \ 
9n = / j— ^ A(du) 

J [0,1] u 

< I *~ {l ~ u)n A{du) = H(n), 
J [o,i] u 
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it remains to verify that H(n)/n — > as n — > oo. By assumption, the measure 
fi(du) := K{du)/u is finite and has no mass at zero. We have 

H(n) f 1 - (1 - u) n A(du) f 

= / = / f„{u)v{du), 

n J[o,i] nu u J[o,\] 

where f n (u) := (l-(l-u) n )/(nu) for n E N and u E [0, 1]. Obviously < /„ < 1 
for all n E N and /„ converges pointwise to zero on (0,1] as n — > oo. Thus, 
H(n)/n — + as n — > oo by dominated convergence. □ 

In the following Lemma RTT1 is extended to S-coalescents. 

Lemma 6.3 Let S = aSy + S k a finite measure on the infinite simplex A 
and Zei (Rt)t>o be a E- coalescent. For n E N let I n be the number of equivalence 
classes (blocks) of the restricted coalescent process {g n Rt)t>o after its first jump 
(h := 0). Then, for all neN, 

5 „E(n-/ n ) = aQ+ jf („|»|-f; (l-(l_ ar,)"))^^. (24) 

Proof. Fix n E N. The first summand on the right hand side in is obvious, 
because with probability a = H({0}), the coalescent behaves as the Kingman 
coalescent, in which case we have I n = n — 1 and g n = (™). Thus, without loss 
of generality we can and do assume that a — 0. 

In the following we exploit Schweinsberg's [53] Poisson process construction of 
exchangeable coalescents. Note that this construction is essentially equivalent to 
Kingman's [TSj paintbox construction and closely related to the Bernoulli sieve 

M- 

For given x € A partition [0,1) into intervals Jo, Ji, Jit ■ ■ of lengths xq :— 
1 - \x\,xi,x 2 , ■ . ., i.e., J := [0,x ), J\ := [xo,aso + xi), J 2 := [xo + Xi,x + 
x\ + X2) and so on. Let U±, . . . , U n be independent random variables uniformly 
distributed on [0, 1). For i E N let 

n 

Xi := Xi(n) := £1^^) 
J=i 

denote the number of Ui, . . . ,U n which fall into the interval Ji . Note that Xi 
is binomially distributed with parameters n and Xi and that 52°1 Xi = n. 
Therefore, 

P(f|{^<l}) 

n 

= P(X =n)+^ ^ P(X = n-Z,Xi 1 =1,...,^ = 1) 

(=1 'l, €N 
all distinct 

= + ( ^ J ^o ^ii ' ' ' x n ■ 

1=1 ^ ' il 

all distinct 
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We have 
g„E(I n ) 



n—1 n— 1 f . oo _ / , v 

-o(*e) 



r & _l_ I L _l_ /j 

Y, k 9nk = / P ( X 0+^1{X S >1} = * 

fc=l fc=l ^ A i=l 

„ n— 1 oo , — / , \ 



(a;, a) 



- /jE(Io) + fF(I,>l)-nP(n{I,<l}))|^. (25) 
Now subtract this expression from (see Schweinsberg [551 P- 36, Eq. (70)]) 

f (i n ST^ f n \ n-l \ E (dx) 

ng n = n 1 - x - 2^ [ j ) x o 2^ x * ' ' ' x ' 



J J u ^ 11 H (x,x) 

1=1 x 7 ii,...,ij€N / y ' 

all distinct 



E (dx) 



(a, a) 



= ^fn-r^(fj{Xi<l^ 

and note that E(X ) = n(l - |x|) and that P(X l > 1) = 1 - (1 - x l ) n . □ 

For n € N\{1} we now study the number V n of internal branches of the restricted 
coalescent process which start after the time T n of the first jump of the restricted 
coalescent process (g n Rt)t>o- Note that V n — I n — S n , where S n denotes the 
number of singleton blocks of the restricted coalescent process (g n Rt)t>o after 
its first jump. 

Lemma 6.4 For all n € N \ {1}, 

9n E(v n ) = Q+ ^^(l-a-^r-n^a-^r- 1 )^^. (26) 

Proof. Fix n <= N \ {1}. Again, without loss of generality we can and do assume 
that a = 0. Using the notation of the previous proof it follows that 



Tl — 1 n oo 

E(S n ) = J2 S ^(^O + ^Tlpf^l 



S 



s=0 

n-1 



{*>=!} = S 



5o(cte) 
(a, a;) 

S (rfa:) 
(x,x) 

S (dx) 



/ e(x )+e^ = i)-^(D{ x ^ 1 }) 

JA\ / 



(a?,x) 



i=l 

If we subtract this quantity from the expression (|25p already derived for (7 rl E(/„ 
we arrive at 



S„E(Vy = / VP(X i >2 



5q(^) 
(a;,ar) 
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and the lemma follows from P(Xi > 2) = 1 — (1 — Xi) n — nxi(l — Xi) n 1 . □ 
Remark. Fix n G N \ {1}. For A-coalescents, (f26"|) reduces to 



ff»E(K) = / (1 - (1 - x) n - nx(l - xY 1 - 1 )- 

JiO.l] 



9n- 

'[0,1] 

Thus, E(y„) = 1, which is clear, as V n = 1 for coalescents with only multiple 
(no simultaneous multiple) collisions. 

Corollary 6.5 If flfl)) ZioWs, then lim^oo E(n — I„)/E(V n ) = oo. 
Proof. Define the auxiliary function H : N — > K via 

:= / E(l-(1-^")|^, neN. 

Note that 1 — (1 — ajj)" < nx^ for n € N and xi £ [0, 1], and, therefore, 

< H(n) <n [ \x\^—l = nH(l) < oo. 
Ja\{q} [x,x) 

We rewrite in terms of the auxiliary function H as g n E(n — /„) = nH(l) — 
H(n). Moreover, from ^26j) it follows that S„E(K) < H(n). Thus, 

E(n- J n ) > rcff(l) - H(n) nff(l) 
E(K) " H(n) J7(n) 

It remains to verify that limn^oo H(n)/n = 0. By assumption, the measure 
jjL{dx) :— (\x\/{x,x)yE{dx) is finite and has no mass at zero. We have 



H{n) = / f n (x)fi(dx), 
Ja\{o} 

where /„(#) := ESiC 1 ~ Q- ~ ZiD/faM) for n € N and x e A\ {0}. From 
1 - (1 - Xi) n < nx t for x 2 € [0, 1] it follows that < /„ < 1 for all n e N. 
It is shown below that /„ converges pointwise to zero on A \ {0} as n — ► oo. 
Therefore, H(n)/n — ► as n — > oo by dominated convergence and the corollary 
is established. In order to verify the pointwise convergence of f n to zero fix 
x 6 A \ {0} and let 8n denote the counting measure on N. We have 



1 — (1 — x i) n f 

<\fn(x) = 2_, ' = 9ndS N 



with g n : N — *■ K defined via <? n (i) := (1 — (1 — Xi) n )/n. Obviously <?„ — > 
pointwise as n — ► oo, as < g n < 1/n for all n € N. Moreover, g n (i) < Xi =: 
for all n G N. The function g is integrable with respect to the counting 
measure en (J gdSj$ — x i — Thus, / n (a;) — > as n — > oo by dominated 

convergence. □ 

Acknowledgement. The authors thank Alex Iksanov for fruitful comments 
concerning the remark before Corollary 13.21 and the final remark of Section [3] 
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